$ε$-Kernel Coresets for Stochastic Points
نویسندگان
چکیده
With the dramatic growth in the number of application domains that generate probabilistic, noisy and uncertain data, there has been an increasing interest in designing algorithms for geometric or combinatorial optimization problems over such data. In this paper, we initiate the study of constructing ε-kernel coresets for uncertain points. We consider uncertainty in the existential model where each point’s location is fixed but only occurs with a certain probability, and the locational model where each point has a probability distribution describing its location. An εkernel coreset approximates the width of a point set in any direction. We consider approximating the expected width (an ε-exp-kernel), as well as the probability distribution on the width (an (ε, τ)-quant-kernel) for any direction. We show that there exists a set of O(ε−(d−1)/2) deterministic points which approximate the expected width under the existential and locational models, and we provide efficient algorithms for constructing such coresets. We show, however, it is not always possible to find a subset of the original uncertain points which provides such an approximation. However, if the existential probability of each point is lower bounded by a constant, an ε-exp-kernel is still possible. We also provide efficient algorithms for construct an (ε, τ)-quant-kernel coreset in nearly linear time. Our techniques utilize or connect to several important notions in probability and geometry, such as Kolmogorov distances, VC uniform convergence and Tukey depth, and may be useful in other geometric optimization problem in stochastic settings. Finally, combining with known techniques, we show a few applications to approximating the extent of uncertain functions, maintaining extent measures for stochastic moving points and some shape fitting problems under uncertainty. 1998 ACM Subject Classification B.2.4 Algorithms, F.2.2 Geometrical problems and computations
منابع مشابه
eps-Kernel Coresets for Stochastic Points
7 We study the problem of constructing ε-kernel coresets for uncertain points. We consider uncertainty8under the existential model where each point’s location is fixed but only occurs with a certain probability,9and the locational model where each point has a probability distribution describing its location. An ε-10kernel coreset approximates the width of a point set...
متن کاملepsilon-Kernel Coresets for Stochastic Points
With the dramatic growth in the number of application domains that generateprobabilistic, noisy and uncertain data, there has been an increasing interest in designingalgorithms for geometric or combinatorial optimization problems over such data. Inthis paper, we initiate the study of constructing ε-kernel coresets for uncertain points.We consider uncertainty in the existenti...
متن کاملNear-Optimal Coresets of Kernel Density Estimates
We construct near-optimal coresets for kernel density estimate for points in Rd when the kernel is positive definite. Specifically we show a polynomial time construction for a coreset of size O( √ d log(1/ε)/ε), and we show a near-matching lower bound of size Ω( √ d/ε). The upper bound is a polynomial in 1/ε improvement when d ∈ [3, 1/ε2) (for all kernels except the Gaussian kernel which had a ...
متن کاملSmall and Stable Descriptors of Distributions for Geometric Statistical Problems
This thesis explores how to sparsely represent distributions of points for geometric statistical problems. A coreset C is a small summary of a point set P such that if a certain statistic is computed on P and C, then the difference in the results is guaranteed to be bounded by a parameter ε. Two examples of coresets are εsamples and ε-kernels. An ε-sample can estimate the density of a point set...
متن کاملImproved Coresets for Kernel Density Estimates
We study the construction of coresets for kernel density estimates. That is we show how to approximate the kernel density estimate described by a large point set with another kernel density estimate with a much smaller point set. For characteristic kernels (including Gaussian and Laplace kernels), our approximation preserves the L∞ error between kernel density estimates within error ε, with cor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1411.0194 شماره
صفحات -
تاریخ انتشار 2014